Error-correcting Output Coding for Text Classiication

نویسنده

  • Adam Berger
چکیده

This paper applies error-correcting output coding (ECOC) to the task of document cate-gorization. ECOC, of recent vintage in the AI literature, is a method for decomposing a multiway classiication problem into many binary classiication tasks, and then combining the results of the subtasks into a hypothesized solution to the original problem. There has been much recent interest in the machine learning community about algorithms which integrate \advice" from many subordinate predic-tors into a single classiier, and error-correcting output coding is one such technique. We provide experimental results on several real-world datasets, extracted from the Internet, which demonstrate that ECOC can ooer signiicant improvements in accuracy over conventional classiication algorithms.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Error - Correcting Output

This paper applies error-correcting output coding (ECOC) to the task of document cate-gorization. ECOC, of recent vintage in the AI literature, is a method for decomposing a multiway classiication problem into many binary classiication tasks, and then combining the results of the subtasks into a hypothesized solution to the original problem. There has been much recent interest in the machine le...

متن کامل

Error-Correcting Output Coding Corrects Bias and Variance

Previous research has shown that a technique called error-correcting output coding (ECOC) can dramatically improve the classiication accuracy of supervised learning algorithms that learn to classify data points into one of k 2 classes. This paper presents an investigation of why the ECOC technique works, particularly when employed with decision-tree learning algorithms. It shows that the ECOC m...

متن کامل

Bias , Variance , and Error Correcting Output Codes forLocal Learners ?

This paper focuses on a bias variance decomposition analysis of a local learning algorithm, the nearest neighbor classiier, that has been extended with error correcting output codes. This extended algorithm often considerably reduces the 0-1 (i.e., classiication) error in comparison with nearest neighbor (Ricci & Aha, 1997). The analysis presented here reveals that this performance improvement ...

متن کامل

Error-Correcting Output Coding for Text Classification

This paper applies error-correcting output coding (ECOC) to the task of document categorization. ECOC, of recent vintage in the AI literature, is a method for decomposing a multiway classification problem into many binary classification tasks, and then combining the results of the subtasks into a hypothesized solution to the original problem. There has been much recent interest in the machine l...

متن کامل

Extending Local Learners with Error-correcting Output Codes Extending Local Learners with Error-correcting Output Codes

Error-correcting output codes (ECOCs) represent classes with a set of output bits, where each bit encodes a binary classiication task corresponding to a unique partition of the classes. Algorithms that use ECOCs learn the function corresponding to each bit, and combine them to generate class predictions. ECOCs can reduce both variance and bias errors for multiclass classiication tasks when the ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999